Reception apparatus, transmission apparatus, and data processing method

ABSTRACT

Provided are an apparatus and a method that are capable of performing reliable display control with no caption display interruption of the TTML caption data. Specifically, when a set value of the caption display end time specification attribute of the TTML caption data is the continuous display instruction value, memory storing and display processing of caption data currently displayed are continued even after reproduction end timing of an image file, and switching processing of TTML caption data files is performed. When a set value of the caption display start time specification attribute of the TTML caption data is the continuous display instruction value, whether or not there is a text string, display processing of the text string is continued, is determined, and when there is the text string, the displayed text is continuously displayed. With the processing, it is possible to achieve display control with no caption display interruption of the TTML caption data.

TECHNICAL FIELD

The present disclosure relates to a reception apparatus, a transmissionapparatus, and a data processing method. In more detail, the presentdisclosure relates to a reception apparatus, a transmission apparatus,and a data processing method that transmit/receive caption data andperform processing on the caption data.

BACKGROUND ART

There exist various methods for superimposing captions on a picture anddisplaying them. TTML (Timed Text Markup Language) is increasingly used.In the TTML, the caption data in the XML (Extensible Markup Language)format is used, and the display area or display timing can becontrolled.

The TTML has features of being capable of cooperating with a webapplication such as HTML5.

Note that the TTML is described in, for example, Patent Document 1(Japanese Patent Application Laid-open No. 2012-169885).

Patent Document 1: Japanese Patent Application Laid-open No. 2012-169885

SUMMARY Problem to be Solved

The TTML caption data that is the above-mentioned caption data has aconfiguration in which information representing the display timing ofthe caption as well as the caption data to be displayed can bedescribed. However, when switching processing of TTML files isperformed, display of the caption data may be interrupted.

It is an object of the present disclosure to provide a receptionapparatus, a transmission apparatus, and a data processing method thatachieve processing capable of eliminating caption display interruptionin the caption display processing using control information of the TTMLcaption data.

Means for Solving the Problem

A reception apparatus according to a first aspect of the presentdisclosure includes:

a communication unit that receives TTML (Timed Text Markup Language)caption data; and

a data processing unit that performs caption display control, the TTMLcaption data being applied to the caption display control, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

A transmission apparatus according to a second aspect of the presentdisclosure includes:

a data processing unit that generates TTML (Timed Text Markup Language)caption data; and

a communication unit that transmits the TTML caption data, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

A data processing method according to a third aspect of the presentdisclosure, which is executed by a reception apparatus, includes:

receiving, by a communication unit, TTML (Timed Text Markup Language)caption data;

performing, by a data processing unit, caption display control, the TTMLcaption data being applied to the caption display control, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

A data processing method according to a fourth aspect of the presentdisclosure, which is executed by a transmission apparatus, includes:

generating, by a data processing unit, TTML (Timed Text Markup Language)caption data; and

transmitting, by a communication unit, the TTML caption data, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

These and other objects, features and advantages of the presentdisclosure will become more apparent in light of the following detaileddescription on the basis of the embodiments of the present disclosure tobe described later or the accompanying drawings. Note that, the “system”in this specification refers to a logical collective configuration of aplurality of apparatuses, and those apparatuses having respectiveconfigurations are not necessarily provided in the same casing.

Effects

According to the configuration of an embodiment of the presentdisclosure, it is possible to achieve an apparatus and a method that arecapable of performing reliable display control with no caption displayinterruption of the TTML caption data.

Specifically, when a set value of the caption display end timespecification attribute of the TTML caption data is the continuousdisplay instruction value, memory storing and display processing ofcaption data currently displayed are continued even after reproductionend timing of an image file, and switching processing of TTML captiondata files is performed. In the case where a set value of the captiondisplay start time specification attribute of the TTML caption data isthe continuous display instruction value, whether or not there is a textstring of which display processing is continued is determined, and whenthere is the text string, the displayed text is continuously displayed.

According to this configuration, it is possible to achieve displaycontrol with no caption display interruption of the TTML caption data.

Note that the effects described in the specification are merelyexamples. The effects are not limited to the effects described in thespecification. Further, the present disclosure may have effects otherthan the exemplified effects.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 A diagram explaining a configuration example of a communicationsystem for performing processing according to the present disclosure.

FIG. 2 A diagram explaining transmission data transmitted by atransmission apparatus.

FIG. 3 A diagram explaining the caption display processing using theTTML caption data.

FIG. 4 A diagram explaining the caption display processing using theTTML caption data.

FIG. 5 A diagram explaining caption Interruption occurred in the captiondisplay processing using the TTML caption data.

FIG. 6 A diagram explaining caption interruption occurred in the captiondisplay processing using the TTML caption data.

FIG. 7 A diagram explaining caption interruption occurred in the captiondisplay processing using the TTML caption data.

FIG. 8 A diagram explaining the caption display control processing usingthe TTML caption data in which the caption interruption is preventedfrom occurring.

FIG. 9 A diagram explaining the caption display control processing usingthe TTML caption data in which the caption interruption is preventedfrom occurring.

FIG. 10 A diagram showing a flowchart explaining the sequence of thecaption display control processing using the TTML caption data in whichthe caption interruption is prevented from occurring.

FIG. 11 A diagram showing a flowchart explaining the sequence of thecaption display control processing using the TTML caption data in whichthe caption interruption is prevented from occurring.

FIG. 12 A diagram showing a flowchart explaining the sequence of thecaption display control processing using the TTML caption data in whichthe caption interruption is prevented from occurring.

FIG. 13 A diagram explaining configuration examples of a transmissionapparatus and a reception apparatus.

FIG. 14 A diagram explaining a configuration example of the transmissionapparatus.

FIG. 15 A diagram explaining a configuration example of the receptionapparatus.

FIG. 16 A diagram explaining hardware configuration examples of thetransmission apparatus and the reception apparatus being communicationapparatuses.

DESCRIPTION OF PREFERRED EMBODIMENTS

Hereinafter, details of the reception apparatus, the transmissionapparatus, and the data processing method according to the presentdisclosure will be described with reference to the drawings. Note thatdescriptions will be made in the following order.

1. Regarding Configuration Example of Communication System

2. Regarding Outline of Caption Data

3. Regarding Problems Caused when TTML Caption

Data Files Are Switched

4. Regarding Embodiment for Eliminating Interruption of Caption

5. Regarding Sequence of Caption Display Control

6. Regarding Configuration Examples of

Transmission Apparatus and Reception Apparatus

7. Summary of Configuration of Present Disclosure

1. Regarding Configuration Example of Communication System

A configuration example of a communication system for performingprocessing according to the present disclosure will be described firstwith reference to FIG. 1.

As shown in FIG. 1, a communication system 10 includes a transmissionapparatus 20 that transmits image data, audio data, caption data, anddata such as various kinds of control information, and a receptionapparatus 30 that receives the data transmitted from the transmissionapparatus 20.

The transmission apparatus 20 is specifically an apparatus that providescontent, such as a broadcasting station 21 and a content server 22.

On the other hand, the reception apparatus 30 is a client apparatus of ageneral user, and is specifically constituted of a television 31, a PC32, a portable terminal 33, or the like.

The data communication performed between the transmission apparatus 20and the reception apparatus 30 uses any or both of bidirectionalcommunication or unidirectional communication via a network such as theInternet and unidirectional communication through a broadcast wave orthe like.

The data transmission from the transmission apparatus 20 to thereception apparatus 30 is performed in various formats such as MPEG-2TS,MP4, and MMT (MPEG Media Transport). Note that in the case where theprocessing according to the present disclosure is performed, the datadelivery format is not limited.

In any of the MPEG-2TS, MP4, and MMT formats, encoded data being dataconstituting content such as an image (Video), audio, and captions, andthe format for transmitting the control information thereof arespecified.

The transmission apparatus 20 encodes content data, generates a datafile including the encoded data and metadata of the encoded data, storesthe generated encoded data in a packet, and transmits it through abroadcast wave or via a network.

The data provided from the transmission apparatus 20 to the receptionapparatus 30 includes guide information such as program guide,notification information, and control information, as well asreproduction target data such as an image, audio, and captions.

An example of the data transmitted from the transmission apparatus 20will be described with reference to FIG. 2.

In FIG. 2, the kind of data transmitted from the transmission apparatus20 to the reception apparatus 30 is shown. The data transmitted from thetransmission apparatus 20 to the reception apparatus 30 includes thefollowing data:

(a) Image data (Video) 50;

(b) Audio data (Audio) 60;

(c) Caption data (Subtitle) 70; and

(d) Signaling data 80.

These pieces of data are transmitted as a payload of the packetspecified in a communication format.

Note that the signaling data is notification information or controlinformation for the reception apparatus 30, and includes, for example,setting information, data attribute information, and controlinformation. The setting information is necessary for receiving,decoding, and reproducing the image data, audio data and caption data.

2. Regarding Outline of Caption Data

In the configuration of the present disclosure, the TTML (Timed TextMarkup Language) is used. In the TTML, the caption data in the XML(Extensible Markup Language) format is used, and the display area ordisplay timing can be controlled.

An example of the caption display control using the TTML caption datawill be described.

FIG. 3 is a diagram showing a configuration example of the TTML captiondata in the XML format stored in caption data (Subtitle) 70 transmittedfrom the transmission apparatus 20 to the reception apparatus 30, and anexample of caption display performed in the reception apparatus 30 usingthe TTML caption data.

In the (A) the caption data (Subtitle) 70 transmitted from thetransmission apparatus 20 to the reception apparatus 30, for example,the TTML caption data shown in the (B) part of FIG. 3 is stored.

The TTML caption data is, for example, data in the XML format shown inthe (B) part of FIG. 3, and has a configuration in which information ondisplay area, display timing, or the like can be described in additionto the text string, which is actually displayed.

The reception apparatus 30 performs processing of interpreting such TTMLcaption data, generate the display data as shown in the (C) part of FIG.3, i.e., display data obtained by superimposing captions on image data,which is acquired from another packet and is decoded, and display it ona display unit.

In the TTML caption data shown in the (B) part of FIG. 3, the followinginformation:

   <div   regon=“r1”>   <p begin =“00:00:01.878” end=“00:00:05.334”>    Hello   <p begin =“00:00:08.608” end=“00:00:15.296”>     Thank you   </p>  </div? is stored.

With this TTML data, it is possible to achieve control of displaying thecaption data, two text strings of “Hello” and “Thank you,” in an area ata time specified therefor.

<div regon=“r1”> represents control information for displaying thecaption in a particular display area [r1] of the display screen.

<p begin=“00:00:01.878” end=“00:00:05.334”> represents controlinformation that specifies a display start time and a display end timefor the caption data, “Hello.”

It is control information for displaying the caption data, “Hello,” at

the display start time of 00 hour 00 minute 01 second 878 and

the display end time of 00 hour 00 minute 05 second 334.

Note that the time can be measured on the basis of a system clock of thereception apparatus 30.

<p begin=“00:00:08.608” end=“00:00:15.296”> represents controlinformation that specifies a display start time and a display end timefor the caption data, “Thank you.”

It is control information for displaying the caption data, “Thank you,”at

the display start time of 00 hour 00 minute 08 second 608 and

the display end time of 00 hour 00 minute 15 second 296.

Specifically, this TTML caption data includes control information forperforming the following caption display.

The caption, “Hello,” is displayed in the display area [r1] from thetime of 0 hour 00 minute 01 second 878 to the time of 00 hour 00 minute05 second 334.

Thereafter, the caption, “Thank you,” is displayed in the display area[r1] from the time of 0 hour 00 minute 08 second 608 to the time of 00hour 00 minute 15 second 296.

The control information is for performing the above-mentioned captiondisplay.

The display results based on this TTML data are shown as display data ofthe (C1) and (C2) parts of FIG. 3.

The caption data is superimposed on the image data for which the displaytime is specified, and is displayed.

The (C1) part of FIG. 3 represents an example of the display imageobtained by superimposing the caption, “Hello,” on the image, andcorresponds to the display data displayed from the time of 0 hour 00minute 01 second 878 to the time of 00 hour 00 minute 05 second 334.

The (C2) part of FIG. 3 is the display data displayed after the displaydata of (C1) part of FIG. 3, and represents an example of the displayimage obtained by superimposing the caption, “Thank you,” on the image.It corresponds to the display data displayed from the time of 0 hour 00minute 08 second 608 to the time of 00 hour 00 minute 15 second 296.

In FIG. 3, an example of the TTML caption data using an attribute(begin) that represents the display start time of the caption data, andan attribute (end) that represents the display end time is shown. Inaddition thereto, there exists TTML caption data using an attribute(begin) that represents the display start time of the caption data andan attribute (dur) that represents the display time period (duration),for example, as shown in FIG. 4.

In the TTML caption data shown in the (B) part of FIG. 4, the followinginformation:

   <div   regon=“r1”>   <p begin =“00:00:01.878” dur=“03.456”>     Hello  <p begin =“00:00:08.608” dur=“06.668”>     Thank you    </p>  </div?is stored.

Also with this TTML data, it is possible to achieve control ofdisplaying the caption data, two text strings of “Hello” and “Thankyou,” at a time specified therefor, similarly to the example shown inFIG. 3.

<div regon=“r1”> represents control information for displaying thecaption in the particular display area [r1] of the display screen.

<p begin=“00:00:01.878” dur=“03.456”> represents control informationthat specifies a display start time and a display time period for thecaption data, “Hello.”

It is control information for displaying the caption data, “Hello,” at

the display start time of 00 hour 00 minute 01 second 878 for

the display time period of 03 seconds 456.

Note that the time can be measured on the basis of a system clock of thereception apparatus 30.

<ID begin=“00:00:08.608” dur=“06.668”> represents control informationthat specifies a display start time and a display time period for thecaption data, “Thank you.”

It is control information for displaying the caption data, “Thank you,”at

the display start time of 00 hour 00 minute 08 second 608 for

the display time period of 06 seconds 688.

Specifically, this TTML caption data includes control information forperforming the following caption display.

The caption, “Hello,” is displayed in the display area [r1] from thetime of 0 hour 00 minute 01 second 878 for 03 seconds 456.

Thereafter, the caption, “Thank you,” is displayed in the display area[r1] from the time of 0 hour 00 minute 08 second 608 for 06 seconds 688.

The control information is for performing the above-mentioned captiondisplay.

The display results based on this TTML data are shown as display data ofthe (C1) and (C2) parts of FIG. 4.

The caption data is superimposed on the image data for which the displaytime is specified, and is displayed.

The (C1) part of FIG. 4 represents an example of the display imageobtained by superimposing the caption, “Hello,” on the image, andcorresponds to the display data displayed from the time of 0 hour 00minute 01 second 878 for 03 seconds 456.

The (C2) part of FIG. 4 is the display data displayed after the displaydata of (C1) part of FIG. 4, and represents an example of the displayimage obtained by superimposing the caption, “Thank you,” on the image.It corresponds to the display data displayed from the time of 0 hour 00minute 08 second 608 for 06 seconds 688.

As described above with reference to FIGS. 3 and 4, the TTML captiondata is generated as data including a text string to be the caption dataand control information on display timing, display area, or the like ofthe text string.

Note that it is possible to set various kinds of control information inaddition to those described in FIGS. 3 and 4.

3. Regarding Problems Caused when TTML Caption Data Files are Switched

As described above with reference to FIGS. 3 and 4, the TTML captiondata includes a text string to be the caption data and controlinformation on display timing, display area, or the like of the textstring.

However, when the processing using display timing control informationincluded in the TTML data is performed, the displayed text string maydisappear, i.e., the caption interruption may occur when the TTML datafiles are switched.

This problem will be described with reference to FIG. 5.

FIG. 5 shows the following two pieces of information:

(a) data file sequentially received by the reception apparatus 30 fromthe transmission apparatus 20; and

(b) caption data display sequence performed by the reception apparatus30.

As shown by the time axis (t) in the (b) part of FIG. 5, time increasesfrom left to right.

In the example shown in FIG. 5, a processing example in the case wherethe reception apparatus 30 receives, from the transmission apparatus 20,image/audio data files 101 to 103 and three TTML caption data files(TTML-1 to 3) 111 to 113 for performing processing synchronized withthese data files is shown. The reception apparatus 30 uses the receivedthree caption data files 111 to 113 to perform the caption displayprocessing.

Note that the image/audio data files 101 to 103 are each a data file inunits of transfer or processing defined in, for example, a data transfermethod such as MPEG-2TS, MP4, and MMT.

In the case where the data communication format is MP4, for example, theimage/audio data files 101 to 103 each correspond to the encoded datafile “mdat” defined as units of processing in MP4.

On the other hand, in the case where the data communication format isMMT, the image/audio data files 101 to 103 each correspond to theencoded data file “MFU” defined as units of processing in MMT.

In these data files, an image and audio reproduced for a predeterminedtime period are stored. Note that although each of the image/audio datafiles 101 to 103 is shown as one file in FIG. 5, the image file and theaudio file are transferred as an individual file in some cases.

In the image/audio data files 101 to 103, data for reproduction isstored to prevent interruption of data in a processing time period fromoccurring, taking into account the processing time period when files areswitched.

On the other hand, the caption data files 111 to 113 are each TTMLcaption data similar to those described above with reference to FIGS. 3and 4.

For each of the caption data files 111 to 113, a time stamp is set toperform processing synchronized with the image/audio data files 101 to103.

The reception apparatus 30 is capable of performing control ofsynchronizing the image, audio, and captions with each other byperforming processing referring to the time stamp.

By referring to the time stamp, it is possible to determine which of thecaption data files 111 to 113 is associated with any of the image/audiodata files 101 to 103 for performing processing.

As shown in the (a) part of FIG. 5, in the TTML caption data file(TTML-1) 111, the following caption display control data:

the display start time (begin) of 00:00:01.878, and

the display end time (end) of 00:00:05.334 regarding a text string (text1); and

the display start time (begin) of 00:00:08.608, and

the display end time (end) of 00:00:15.296 regarding a text string (text2) is stored.

As shown in the (b) part of FIG. 5, a caption data processing unit ofthe reception apparatus 30 uses the TTML caption data file (TTML-1) 111to perform caption display of the text string 1 and the text string 2.

Specifically, display of the text string 1 is started at a time t1 of00:00:01.878 and finished at a time t2 of 00:00:05.334.

Further, display of the text string 2 is started at a time t3 of00:00:08.608 and finished at a time t4 of 00:00:15.296.

In the next TTML caption data file (TTML-2) 112, the following captiondisplay control data:

the display start time of 00:00:16.278, and

the display end time of 00:00:24.556 regarding a text string (text 3);and

the display start time of 00:00:25.008, and

the display end time of 00:00:36.178 regarding a text string (text 4) isstored.

The caption data processing unit of the reception apparatus 30 switchesprocessing targets to the TTML caption data file (TTML-2) 112 after thedisplay processing of the TTML caption data file (TTML-1) 111, and usesthe TTML caption data file (TTML-2) 112 to perform caption display ofthe text string 3 and the text string 4 as shown in the (b) part of FIG.5.

Specifically, display of the text string 3 is started at a time t5 of00:00:16.278 and finished at a time t6 of 00:00:24.556.

Further, display of the text string 4 is started at a time t7 of00:00:25.008.

However, a time t9 of 00:00:36.178 being the display end time of thetext string 4 is set to be later than the image reproduction end time(t8=00:00:35.442) of the synchronization processing data file associatedwith the TTML caption data file (TTML-2) 112, i.e., the image/audio datafile 102.

In the example shown in the figure, the image reproduction end time ofthe image/audio data file 102 is the t8 of 00:00:35.442.

The reception apparatus 30 switches processing to the next image/audiodata file 103 when the reproduction end time (t8=00:00:35.442) of theimage/audio data file 102 being the main data has come. During the fileswitching, the reception apparatus 30 switches also caption data files.

Interruption of the image/audio data does not occur because reproductiondata taking into account the processing time period for the fileswitching is stored in the previous file to prevent data interruptionfrom occurring.

However, the TTML caption data only includes control information servingas the display start time and the display end time or the display timeperiod, and does not take into account the data continuity through fileswitching of image data or the like.

The caption data processing unit of the reception apparatus 30 stops,when the TTML caption data files are switched, display of the captiondata currently displayed. Specifically, the caption data processing unitdeletes the caption data for display from the memory that stores thedata, stops the caption display, and processes the next TTML captiondata.

As a result, the display of the caption (text 4) of the TTML captiondata file (TTML-2) 112 shown in FIG. 5 is interrupted.

Specifically, the caption data processing unit switches caption datafiles together with image/audio data files, and performs, when the fileswitching processing is performed, processing of deleting the displaytext data from the memory and stop the display.

As described above, in the case where the display end time of thecaption (text 4) of the TTML caption data file (TTML-2) 112 is set to belater than the reproduction end time of the image/audio data file 102being the main data, the file switching forces display of the displaytext data to quit and thus, the text display disappears.

Specifically, during the time period from the t8 to the time t9 shown inthe (B) part of FIG. 5, caption interruption display occurs.

The state where interruption of text display occurs in the example shownin FIG. 5 will be described in detail.

The time t8=00:00.35.442 shown in the (b) part of FIG. 5 corresponds tothe reproduction end time of the image/audio data file 102 being themain data.

In the TTML caption data file (TTML-2) 112, the following display timingcontrol information

the display start time of 00:00:25.008, and

the display end time of 00:00:36.178 regarding the text string 4 isstored.

The display end time=00:00:36.178 of the text string 4 is set to belater than the reproduction end time (t8=00:00:35.442) of theimage/audio data file 102 being the main data, which is set as asynchronization reproduction target with the TTML-2.

As described above, even when the display end time stored in the TTMLcaption data is set to be later than the reproduction end time of theimage/audio data file being the main data, the caption data processingunit of the reception apparatus 30 switches caption data files insynchronization with the image/audio data file switching. This fileswitching causes caption interruption display to occur during the timeperiod from the time t8 to the time t9 shown in the (B) part of FIG. 5.

Thereafter, the caption data processing unit of the reception apparatus30 starts processing for the next caption data file (TTML-3) 113.

In the next TTML caption data file (TTML-3) 113, the following captiondisplay control data

the display start time of 00:00:36.178, and

the display end time of 00:00:50.856 regarding the text string (text 4)is stored.

As shown in the (b) part of FIG. 5, in accordance with this displaycontrol information, the caption data processing unit of the receptionapparatus 30 starts display of the text string 4 at the time t9 of00:00:36.178 and finishes the display at a time t10 of 00:00:50.856.

The control information on the caption data file (TTML-2) 112 and thecaption data file (TTML-3) 113 is originally the following information:control information for continuously displaying the text string (text 4)during the time period from the display start time (t7) to the displayend time (t10).

However, by switching the caption data files at the reproduction endtime (t8=00:00:35.442) of the image/audio data file 102 being thesynchronization processing data file, the above-mentioned captioninterruption occurs during the time period from the time t8 to the timet9.

Note that it is possible not to record a value (time information) in theattribute (end) of the caption display end time in the TTML captiondata. In the case where a value (time information) is not recorded inthe attribute (end), caption display is performed on the caption untilthe processing end time of the caption data file.

Specifically, it is setting shown in the caption data file (TTML-2) 112in FIG. 6.

The caption data file (TTML-2) 112 shown in FIG. 6 includes thefollowing information:

  <p begin=“00:00:16.278” end=“00:00:24.556” > text 3</p> <pbegin=“00:00:25.008”> text 4</p>.

The above-mentioned caption data file (TTML-2) 112 includes thefollowing information:

the display start time of 00:00:16.278, and

the display end time of 00:00:24.556 regarding the text string (text 3);and

the display start time of 00:00:25.008 regarding the text string (text4).

The display end time is not recorded for the text string (text 4). Inthis case, the caption data processing unit that performs the captiondisplay control performs caption display on the text string (text 4)until the processing end time of the caption data file.

In this case, however, the processing end time of the caption data file(TTML2) 112 is the image reproduction end time (t8=00:00:35.442) of theimage/audio file 102 being the synchronization processing data. At thispoint, the TTML caption data files are switched. After all, the displayof the text string (text 4) is interrupted during the file switchingprocessing.

As described above with reference to FIG. 4, it is possible to use theattribute (dur) that represents the display time period (duration)instead of the attribute (end) of the caption display end time in theTTML caption data.

An example using the attribute (dur) that represents the display timeperiod (duration) is shown in FIG. 7.

The caption data file (TTML-2) 112 shown in FIG. 7 includes thefollowing information:

  <p begin=“00:00:16.278”  dur=“08.278”> text 3</p> <pbegin=“00:00:25.008”  dur=“11.170”> text 4</p>.

The above-mentioned caption data file (TTML-2) 112 includes thefollowing information:

the display start time of 00:00:16.278, and

the display time period of 08.278 seconds regarding the text string(text 3); and

the display start time of 00:00:25.008, and

the display time period of 11.170 seconds regarding the text string(text 4).

Also in this case, however, display interruption of the text string(text 4) occurs when the caption data files are switched similarly tothe case described with reference to FIGS. 5 and 6.

In accordance with the control information on the caption data file(TTML-2) 112, display of the text string 4 is started at the displaystart time of 00:00:25.008.

The time when the display time period of 11.170 seconds is finished isthe time t9=00:00:36.178 shown in the (b) part of FIG. 7.

The time t9 is set to be later than the reproduction end time(t8=00:00:35.442) of the image/audio data file 102 being the main data,which is set as a synchronization reproduction target with the TTML-2.

As described above, even when the display end time recorded in the TTMLcaption data is set to be later than the reproduction end time of theimage/audio data file being the main data, the caption data processingunit of the reception apparatus 30 switches caption data files insynchronization with the image/audio data file switching. This fileswitching causes caption interruption display to occur during the timeperiod from the time t8 to the time t9 shown in the (b) part of FIG. 7.

As described above, caption interruption occurs regardless of the kindof control information recorded in the TTML caption data.

4. Regarding Embodiment for Eliminating

Interruption of Caption

In the following, an embodiment for solving the above-mentioned problemof caption interruption will be described.

FIG. 8 is a diagram showing a processing example in the receptionapparatus 30.

Similarly to the processing examples described with reference to FIGS. 5to 7, FIG. 8 shows the following two pieces of information:

(a) data file sequentially received by the reception apparatus 30 fromthe transmission apparatus 20; and

(b) caption data display sequence performed by the reception apparatus30.

As shown by the time axis (t) in the (b) part of FIG. 8, time increasesfrom left to right.

In the example shown in FIG. 8, a processing example in the case wherethe reception apparatus 30 receives, from the transmission apparatus 20,the image/audio data files 101 to 103 and three TTML caption data files(TTML-1 to 3) 121 to 123 for performing processing synchronized withthese data files is shown. The reception apparatus 30 uses the receivedthree caption data files 121 to 123 to perform the caption displayprocessing.

Note that the image/audio data files 101 to 103 are each a data file inunits of transfer or processing defined in, for example, a data transfermethod such as MPEG-2TS, MP4, and MMT.

In the case where the data communication format is MP4, for example, theimage/audio data files 101 to 103 each correspond to the encoded datafile “mdat” defined as units of processing in MP4.

On the other hand, in the case where the data communication format isMMT, the image/audio data files 101 to 103 each correspond to theencoded data file “MFU” defined as units of processing in MMT.

In these data files, an image and audio reproduced for a predeterminedtime period are stored. Note that although each of the image/audio datafiles 101 to 103 is shown as one file in the figure, the image data fileand the audio data file may be set as an individual data file.

In the image/audio data files 101 to 103, data for reproduction isstored to prevent interruption of data during the file switching timeperiod from occurring, taking into account the processing time periodwhen files are switched.

On the other hand, the caption data files 121 to 123 are each TTMLcaption data similar to those described above with reference to FIGS. 3and 4.

For each of the caption data files 121 to 123, a time stamp is set toperform processing synchronized with the image/audio data files 101 to103.

The reception apparatus 30 is capable of synchronizing the image data,audio data, and caption data with each other and outputting it byperforming processing referring to the time stamp of each data file ofthe image, audio, and captions. By referring to the time stamp, thereception apparatus 30 is capable of determining which of the captiondata files 121 to 123 is associated with any of the image/audio datafiles 101 to 103 for performing processing.

As shown in the (a) part of FIG. 8, in the TTML caption data file(TTML-1) 121, the following caption display control data:

the display start time of 00:00:01.878, and

the display end time of 00:00:05.334 regarding the text string (text 1);and

the display start time of 00:00:08.608, and

the display end time of 00:00:15.296 regarding the text string (text 2)is stored.

This is the control information similar to the TTML caption data file(TTML-1) 111 described above with reference to FIG. 5.

As shown in the (b) part of FIG. 8, the caption data processing unit ofthe reception apparatus 30 uses the TTML caption data file (TTML-1) 121to perform caption display of the text string 1 and the text string 2.

Specifically, the display of the text string 1 is started at the time 1of 00:00:01.878 and finished at the time t2 of 00:00:05.334.

Further, the display of the text string 2 is started at the time t3 of00:00:08.608 and finished at the time t4 of 00:00:15.296.

In the next TTML caption data file (TTML-2) 122, the following data:

   <p begin=“00:00:16.278”  end=“00:00:24.556” >  text 3</p>  <pbegin=“00:00:25.008”  end=“indefinite” >  text 4</p> is stored.

This data represents that:

the display start time is 00:00:16.278, and

the display end time is 00:00:24.556 regarding the text string (text 3);and

the display start time is 00:00:25.008, and

the display end time is the continuous display instruction value(indefinite) regarding the text string (text 4).

This data is different from the TTML caption data file (TTML-2) 112described above with reference to FIG. 5 in that the display end time ofthe text string (text 4) is not determined (indefinite). This displayend time being the continuous display instruction value (indefinite)represents control information for switching files while continuingdisplay of the text being a display target.

The image/audio data processing unit of the reception apparatus 30switches processing to the next image/audio data file 103 when thereproduction end time (t8=00:00:35.442) of the image/audio data file 102being the main data, which is set as a synchronization processing targetwith the TTML caption data file (TTML-2) 122, has come.

On the other hand, in the case where the display end time is set to thecontinuous display instruction value (indefinite) as control informationrelating to the text currently displayed on the basis of the TTMLcaption data being a processing target, the caption data processing unitcontinues the text display using the data stored in the memory withoutdeleting the text string in the memory, which stores the displayed textstring on the display unit. The caption data processing unit switchesfiles in parallel with the text display continuing processing.

In the example shown in FIG. 8, similarly to the examples describedabove with reference to FIGS. 5 to 7, the reproduction end time(t8=00:00:35.442) of the image/audio data file 102 being the main data,which is set as a synchronization processing target with the TTMLcaption data file (TTML-2) 122, comes while the text string (text 4) ofthe TTML caption data file (TTML-2) 122 is displayed.

The caption data processing unit performs the following processing onthe basis of confirmation of the text display end time of the textstring (text 4) of the TTML caption data file (TTML-2) 122 being set tothe continuous display instruction value (indefinite).

The caption data processing unit continues the text display using thedata (text 4) stored in the memory without deleting the text string inthe memory, which stores the displayed text string (text 4) on thedisplay unit.

Further, the caption data processing unit switches files, i.e., performsfile switching from the TTML caption data file (TTML-2) 122 to the TTMLcaption data file (TTML-3) 123 in parallel, while continuing the textdisplay.

The next TTML caption data file (TTML-3) 123 includes the followinginformation:

  <p begin=“indefinite8” end=“00:00:50.856” > text 4</p>.

This data represents that regarding the text string (text 4):

the display start time is the continuous display instruction value(indefinite); and

the display end time is 00:00:50.856.

At the time of the file switching, the caption data processing unit ofthe reception apparatus 30 continues the memory storing and display ofthe text data (text 4) in accordance with control information (displayend time of the text string (text 4) being the continuous displayinstruction value (indefinite)) on the TTML caption data file (TTML-2)122 being the previous processing file.

The caption data processing unit switches the processing target file tothe TTML caption data file (TTML-3) 123, and detects, from the TTMLcaption data file (TTML-3) 123 after the switching, the followingcontrol information of the text string (text 4):

the display start time of the continuous display instruction value(indefinite); and

the display end time of 00:00:50.856.

The caption data processing unit confirms that the text display starttime of the text string (text 4) of the TTML caption data file (TTML-3)123 is set to the continuous display instruction value (indefinite).

On the basis of this confirmation, the caption data processing unitdetermines whether or not

(a) text string (text 4) for which the display start time is set to thecontinuous display instruction value (indefinite) and

(b) text string that is stored in the memory and is continuouslydisplayed match.

In the case where the (a) text string and the (b) text string match, thetext string currently displayed is stored in the memory as it is, anddisplay of the text string is continued.

It should be noted that the text display control processing is performedon the basis of the file after the file switching, i.e., the TTMLcaption data (TTML-3) 123.

By this processing target file switching, display of the text string(text 4) is continued until the display end time of t10 being00:00:50.856 recorded in the TTML caption data file (TTML-3) 123, andfinished at the time t10.

By performing such processing, as shown in the (b) part of FIG. 8, thetext string (text 4) is continuously displayed without interruptionduring the file switching.

Note that the above-mentioned (a) and (b) do not match in some cases.

For example, the reception apparatus that receives a program (content)in the middle receives the caption data file (TTML-3) 123 withoutreceiving the caption data file (TTML-2) 122 shown in FIG. 8, and theprogram is viewed.

In this case, the (a) text string (text 4) for which the display starttime is set to the continuous display instruction value (indefinite) canbe acquired from the caption data file (TTML-3) 123.

However, the (b) text string that is stored in the memory and iscontinuously displayed does not exist. Specifically, there is no textstring that is stored in the memory and is continuously displayed.

In this case, the caption data processing unit of the receptionapparatus 30 performs processing in accordance with only the captiondata file (TTML-3) 123.

The caption data processing unit detects, from the TTML caption datafile (TTML-3) 123, the following control information of the text string(text 4):

the display start time of the continuous display instruction value(indefinite); and

the display end time of 00:00:50.856.

In accordance with the detection information, the caption dataprocessing unit immediately displays the text string (text 4), andfinishes the display at the display end time of 00:00:50.856.

As described above, in the case where there is no text string that isstored in the memory and is continuously displayed, the caption dataprocessing unit performs processing of immediately displaying thespecified text string in accordance with the control information inwhich the display start time is the continuous display instruction value(indefinite).

Note that in the embodiment described with reference to FIG. 8, only anexample in which only any of the display start time and the display endtime is the continuous display instruction value (indefinite) is shown.

However, there exists control information in which both of the displaystart time and the display end time are the continuous displayinstruction value (indefinite), i.e.,

the display start time is the continuous display instruction value(indefinite), and

the display end time is the continuous display instruction value(indefinite).

In this case, on the basis of the control information in which thedisplay start time is the continuous display instruction value(indefinite), processing of comparing

the (a) text string for which the display start time is set to thecontinuous display instruction value (indefinite) and

the (b) text string that is stored in the memory and continuouslydisplayed will be performed first.

In this comparison processing, only in the case where the (a) is the(b), the text being continuously displayed is continuously displayed asit is and the processing target file is switched.

Further, in accordance with the control information in which the displayend time is the continuous display instruction value (indefinite),switching processing to the next TTML caption data file is performedwhile continuing the text display.

By performing such processing, it is possible to continuously displaythe same caption data by applying three or more TTML caption data files.

As described above, in this embodiment, in the case where the displayend time of the continuous display instruction value (indefinite) isrecorded as control information on a particular text string (text x)stored in the TTML caption data file, the caption data processing unitcontinues the display without deleting the data for displaying theparticular text string (text x) from the memory that store the data.

During the display being continued, the caption data processing unitswitches the processing target file.

Further, in the case where the display start time of the continuousdisplay instruction value (indefinite) is recorded as controlinformation on the particular text string (text x) stored in the TTMLcaption data file after the switching, the data for displaying theparticular text string (text x) is stored in the memory, and the displayis performed, the display is continued.

While continuing the display, the caption data processing unit startsprocessing on the basis of the processing target file after theswitching.

By performing such processing, as shown in FIG. 8, the text string (text4) is continuously displayed during the time period from the time t7 tothe time t10 without interruption at the end of the image reproductiontime (t8) of the image/audio data file.

In the example shown in FIG. 8, it is possible to display the textstring (text 4) without display interruption during the time period fromthe display start time specified in the caption data file (TTML-2) 122to the display end time specified in the caption data file (TTML-3) 122.The display start time is the time t7=00:00:25.008, and the display endtime is the time t10=00:00:50.856.

The example shown in FIG. 8 is a processing example using the attribute(end) that represents the caption display end time. Even when theattribute (dur) that represents the display time period (duration)described above with reference to FIG. 7 is used, it is possible tocontinuously perform the caption display similar to that shown in FIG.8.

FIG. 9 shows an embodiment using the attribute (dur) that represents thedisplay time period (duration).

In a TTML caption data file (TTML-2) 132 shown in FIG. 9, the followingdata:

   <p begin=“00:00:16.278”  dur=“08.278” >  text 3</p>  <pbegin=“00:00:25.008”  dur=“indefinite” >  text 4</p> is stored.

This data represents that:

the display start time is 00:00:16.278, and

the display time period is 08.278 seconds regarding the text string(text 3); and

the display start time is 00:00:25.008, and

the display time period is the continuous display instruction value(indefinite) regarding the text string (text 4).

The processing for dur=“indefinite”> being the control information onthe text string (text 4) is similar to the processing forend=“indefinite”> being the control information described with referenceto FIG. 8.

Specifically, the display time period being the continuous displayinstruction value (indefinite) is the control information for switchingfiles while continuing the display of the display target text.

The image/audio data processing unit of the reception apparatus 30switches processing to the next image/audio data file 103 when thereproduction end time (t8=00:00:35.442) of the image/audio data file 102being the main data, which is set as a synchronization processing targetwith the TTML caption data file (TTML-2) 122 has come.

On the other hand, in the case where the display time period (dur) isset to the continuous display instruction value (indefinite) as controlinformation relating to the text currently displayed on the basis of theTTML caption data being a processing target, the caption data processingunit continues the text display using the data stored in the memorywithout deleting the text string in the memory, which stores thedisplayed text string on the display unit. The caption data processingunit switches files in parallel with the text display continuingprocessing.

In the example shown in FIG. 9, similarly to the examples describedabove with reference to FIGS. 5 to 7, the reproduction end time(t8=00:00:35.442) of the image/audio data file 102 being the main data,which is set as a synchronization processing target with the TTMLcaption data file (TTML-2) 132, comes while the text string (text 4) ofthe TTML caption data file (TTML-2) 132 is displayed.

The caption data processing unit performs the following processing onthe basis of confirmation of the display time period (duration) of thetext string (text 4) of the TTML caption data (TTML-2) 122 being set tothe continuous display instruction value (indefinite).

The caption data processing unit continues the text display using thedata (text 4) stored in the memory without deleting the text string inthe memory, which stores the displayed text string (text 4) on thedisplay unit.

Further, the caption data processing unit switches files, i.e., performsfile switching from the TTML caption data file (TTML-2) 132 to a TTMLcaption data file (TTML-3) 133 in parallel, while continuing the textdisplay.

The next TTML caption data file (TTML-3) 133 includes the followinginformation:

  <p begin=“indefinite8” end=“00:00:50.856” > text 4</p>.

This data represents that: the display start time is the continuousdisplay instruction value (indefinite) and the display end time is00:00:50.856 regarding the text string (text 4).

At the time of the file switching, the caption data processing unit ofthe reception apparatus 30 continues the memory storing and display ofthe text data (text 4) in accordance with control information (displaytime period of the text string (text 4) being the continuous displayinstruction value (indefinite)) on the TTML caption data file (TTML-2)132 being the previous processing file.

The caption data processing unit switches the processing target file tothe TTML caption data file (TTML-3) 133, and detects, from the TTMLcaption data file (TTML-3) 133 after the switching, the followingcontrol information of the text string (text 4):

the display start time of the continuous display instruction value(indefinite); and

the display end time of 00:00:50.856.

The caption data processing unit confirms that the text display starttime of the text string (text 4) of the TTML caption data (TTML-3) 123is set to the continuous display instruction value (indefinite).

Based on this confirmation, the caption data processing unit determineswhether or not

the (a) text string (text 4) for which the display start time is set tothe continuous display instruction value (indefinite) and

the (b) text string that is stored in the memory and is continuouslydisplayed match.

In the case where the (a) text string and the (b) text string match, thetext string currently displayed is stored in the memory as it is, anddisplay of the text string is continued.

It should be noted that the text display control processing is performedon the basis of the file after the file switching, i.e., the TTMLcaption data (TTML-3) 133.

By this processing target file switching, display of the text string(text 4) is continued until the display end time of t10 being00:00:50.856 recorded in the TTML caption data file (TTML-3) 133, andfinished at the time t10.

By performing such processing, as shown in the (b) part of FIG. 9, thetext string (text 4) is continuously displayed without interruptionduring the file switching.

As described above, it is possible to continuously perform captiondisplay without display interruption along with the file switching, evenwhen any of the attribute (end) that represents the caption display endtime and the attribute (dur) that represents the display time period(duration) is used.

Note that in the embodiment described with reference to FIGS. 8 and 9,an example in which the continuous display instruction value set foreach attribute (end, begin, or dur) is [indefinite] as the controlinformation for continuously performing caption display has beendescribed. However, this value is only an example, and it is possible tocontinuously perform caption display similar to the above-mentionedprocessing by using, for example, [00:00:00.000] or another value as thecontinuous display instruction value.

In the above-mentioned embodiment, an example in which values set forattributes (end, begin, and dur) are all [indefinite] has beendescribed. However, it is possible to set a different value for eachattribute, and perform processing similar to the above-mentionedprocessing.

5. Regarding Sequence of Caption Display Control

Next, the sequence of caption display control performed in the receptionapparatus 30 will be described with reference to FIGS. 10 to 13.

The flowchart shown in FIG. 10 is a flow explaining the processingsequence relating to the caption display control performed in thereception apparatus 30.

The processing in accordance with the flow shown in FIG. 10 is performedin accordance with a program stored in a storage unit under the controlof the data processing unit including a CPU having a program executingfunction in the reception apparatus, for example.

Hereinafter, the processing of each Step will be described.

(Step S101)

First, the reception apparatus receives data including image data, audiodata, and caption data in Step S101.

The received data includes the TTML caption data.

(Step S102)

Next, the reception apparatus extracts a TTML caption data file from thereceived data.

As described above with reference to FIGS. 3 and 4 and the like, in theTTML caption data file, the displayed text string and controlinformation such as display timing corresponding to the text string arestored.

(Step S103) The caption data processing unit of the reception apparatusselects, from the TTML caption data file, the caption data being adisplay target.

The caption data processing unit selects the caption data being aprocessing target in the order of recorded data of the TTML captiondata, and starts the caption display processing in accordance with thecontrol information.

(Step S104)

Next, the caption data processing unit performs processing correspondingto the display end time specification attribute (end attribute or durattribute) that has specified the display end time of the display targetcaption data in Step S104.

The detailed processing of Step S104 will be described with reference tothe flow shown in FIG. 11.

(Step S201)

First, the caption data processing unit determines whether or not thedisplay end time specification attribute (end attribute or durattribute) of the display target text string is the continuous displayinstruction value (indefinite) in Step S201.

In the case where the attribute is the continuous display instructionvalue (indefinite), the processing proceeds to Step S202. In the casewhere the attribute Is not the continuous display instruction value(indefinite), the processing proceeds to Step S203.

(Step S202)

In the case where it is confirmed that the display end timespecification attribute (end attribute or dur attribute) of the displaytarget text string is the continuous display instruction value(indefinite), the reception apparatus performs, in Step S202, thefollowing processing: storing the display target caption data in thememory even after the end of delivery data unit (image/audio data filebeing the main data) and continuing the display processing of the textstring stored in the memory.

The caption data processing unit performs switching processing of theTTML caption data files together with switching processing of theimage/audio data files being the main data. Also during the fileswitching processing, the memory storing and display of the text string(text x) of which the display end time specification attribute (endattribute or dur attribute) is confirmed to be the continuous displayinstruction value (indefinite) are continued.

(Step S203)

On the other hand, in the case where it is confirmed that the displayend time specification attribute (end attribute or dur attribute) of thedisplay target text string is not the continuous display instructionvalue (indefinite), the reception apparatus performs, in Step S203, thefollowing processing: finishing the display of the display targetcaption data in accordance with the specification by the display endtime specification attribute (end attribute or dur attribute).

The processing of Steps S201 to S203 is performed as the processing ofStep S104 shown in the flow of FIG. 10.

Thereafter, the processing proceeds to Step S105.

(Step S105)

The caption data processing unit of the reception apparatus performsprocessing corresponding to the display start time specificationattribute (begin attribute) that specifies the display start time of thedisplay target caption data in the next Step S105.

The detailed processing of Step S105 will be described with reference tothe flow shown in FIG. 12.

(Step S301)

First, the caption data processing unit determines whether or not thedisplay start time specification attribute (begin attribute) of thedisplay target text string is the continuous display instruction value(indefinite) in Step S301.

In the case where the attribute is not the continuous displayinstruction value (indefinite), the processing proceeds to Step S302. Inthe case where the attribute is the continuous display instruction value(indefinite), the processing proceeds to Step S303.

(Step S302)

In the case where the display start time specification attribute (beginattribute) of the display target text string is not the continuousdisplay instruction value (indefinite), the caption data processing unitperforms, in Step S302, the following processing: display control forstarting display of the display target caption data at the timespecified by the display start time specification attribute (beginattribute).

(Step S303)

On the other hand, in the case where the display start timespecification attribute (begin attribute) of the display target textstring is the continuous display instruction value (indefinite), thecaption data processing unit performs, in Step S303, the followingprocessing: determining whether or not the same text data as the textstring for which the display start time specification attribute (beginattribute) is the continuous display instruction value (indefinite) isstored in the memory and continuously displayed.

In the case where the same text as the text being a processing target isstored in the memory and continuously displayed, the processing proceedsto Step S304, otherwise, the processing proceeds to Step S305.

(Step S304)

In the case where the same text data as the text string for which thedisplay start time specification attribute (begin attribute) is thecontinuous display instruction value (indefinite) is stored in thememory and continuously displayed, the caption data processing unitperforms, in Step S304, the following processing: continuing the displayof the caption that is stored in the memory and currently displayed.

(Step S305)

On the other hand, in the case where the same text data as the textstring for which the display start time specification attribute (beginattribute) is the continuous display instruction value (indefinite) isnot stored in the memory and continuously displayed, the caption dataprocessing unit performs, in Step S305, the following processing:immediately displaying the display target caption data.

Note that the processing performed in this case corresponds to theprocessing in the case where the reception apparatus receives thecaption data file (TTML-3) 123 without receiving the caption data file(TTML-2) 122 and the program (content) is viewed in the middle in theexample shown in FIG. 8, for example, as described above.

The processing of Steps S301 to S305 is performed as the processing ofStep S105 shown in the flow of FIG. 10.

Thereafter, the processing proceeds to Step S106.

(Step S106)

In Step S106, the caption data processing unit of the receptionapparatus determines whether or not there is the next display targetcaption data, and repeats the processing of Step S103 and subsequentSteps when there is the next display target caption data. When there isno next display target caption data, the caption data processing unitfinishes the processing.

Note that in the flow shown in FIG. 10, the processing of Steps S103 toS106 is for one TTML caption data file, and is repeatedly performedevery time a new TTML caption data file is received and extracted inSteps S102 and S102.

By performing the processing described with reference to FIGS. 10 to 13,it is possible to continuously perform caption display without captioninterruption display when the TTML caption files are switched asdescribed above with reference to FIGS. 8 and 9.

6. Regarding Configuration Examples of Transmission Apparatus andReception Apparatus Next, configuration examples of the transmissionapparatus 20 and the reception apparatus 30 being communicationapparatuses will be described with reference to FIG. 13 and subsequentfigures.

FIG. 13 shows configuration examples of the transmission apparatus 20and the reception apparatus 30.

The transmission apparatus 20 includes a data processing unit 141, acommunication unit 142, and a storage unit 143.

The reception apparatus 30 includes a data processing unit 151, acommunication unit 152, a storage unit 153, an input unit 154, and anoutput unit 155.

The data processing unit 141 of the transmission apparatus 20 performsvarious kinds of data processing for performing a data delivery service.For example, the data processing unit 141 controls the generation ortransmission of the data constituting the data delivery service.

Specifically, the data processing unit 141 generates a packet storingthe image/audio data, generates a packet storing the caption data andother signaling data, and performs delivery processing.

Note that the packet storing the caption data is a packet storing theTTML caption data in the XML format.

The communication unit 142 performs transmission processing of thecontrol signal on the image, audio, caption, other signaling data, andthe like, generated by the data processing unit 141.

The storage unit 143 stores, for example, the data of the image, audio,and caption being a delivery target.

Further, the storage unit 143 is used as a work area for the dataprocessing performed by the data processing unit 141, and also as astorage area for various parameters.

On the other hand, the reception apparatus 30 includes a data processingunit 151, a communication unit 152, a storage unit 153, an input unit154, and an output unit 155.

The communication unit 152 receives the data delivered from thetransmission apparatus 20, e.g., packet storing an image, audio, andcaptions and signaling data.

The caption data is the TTML caption data.

The data processing unit 151 performs the display control of the captiondata in accordance with the above-mentioned embodiment, for example.

Specifically, for example, the data processing unit 151 performs thedisplay control of the caption in accordance with the TTML caption data.

A user's instruction command, e.g., content specifying command, is inputvia the input unit 154.

The reproduction data is output to the output unit 155 such as a displayunit and a speaker.

The storage unit 153 stores the data of the received image, audio,caption, and the like.

Further, the storage unit 153 is used as a work area for the dataprocessing performed by the data processing unit 151, and also as astorage area for various parameters.

FIG. 14 is a diagram showing the detailed configuration of the dataprocessing unit 141 of the transmission apparatus 20.

The data processing unit 141 includes an image data processing unit 201,an audio data processing unit 202, a caption data processing unit 203, acontrol signal generation unit 204, and a packet generation unit 205.

The image data processing unit 201 performs processing on the image databeing the data constituting the content to be transmitted to thereception apparatus, e.g., processing of generating data fortransmission such as encoding processing, generates encoded image datato be stored in a packet, and supplies it to the packet generation unit205.

The audio data processing unit 202 performs processing on the audio databeing the data constituting the content to be transmitted to thereception apparatus, e.g., processing of generating data fortransmission such as encoding processing, generates encoded audio datato be stored in a packet, and supplies it to the packet generation unit205.

The caption data processing unit performs processing on the caption databeing the data constituting the content to be transmitted to thereception apparatus.

Specifically, the caption data processing unit performs processing ofgenerating the TTML caption data, and supplies it to the packetgeneration unit 205.

The control signal generation unit 204 generates a control signalincluding program information, for example, and outputs it to the packetgeneration unit 205.

The packet generation unit 205 generates the packet storing the image,audio, caption, and control signal, and supplies it to the communicationunit 142.

Note that in the packet generated by the packet generation unit 205, apacket identifier depending on the data type of the image, audio,caption, and the like, is set.

The communication unit 142 outputs, via a broadcast wave or a network,the packet input from the packet generation unit 205.

FIG. 15 is a diagram showing the detailed configuration of the dataprocessing unit 151 of the reception apparatus 30.

The data processing unit 151 of the reception apparatus 30 includes apacket demultiplexing unit 301, an image data processing unit 302, anaudio data processing unit 303, a caption data processing unit 304, acontrol unit 305, and a superimposition processing unit 306.

The packet demultiplexing unit 301 demultiplexes the received packet foreach data type (image, audio, caption, and control signal) on the basisof the packet identifier of the packet received via the communicationunit 152, and supplies the demultiplexed packet to each data processingunit.

The image data processing unit 302 acquires image data from the packetstoring the image data, and performs processing necessary forreproducing the image such as decoding processing.

The audio data processing unit 303 acquires the audio data from thepacket storing the audio data, and performs processing necessary forreproducing the audio such as decoding processing.

The caption data processing unit 304 acquires the TTML caption data fromthe packet storing the data relating to captions, and controls theoutput timing of the caption to output the caption by using the controlinformation recorded in the TTML caption data.

In addition to the above-mentioned output of the control signal relatingto captions, the control unit 305 controls processing performed in theprocessing units of the image data processing unit 302, the audio dataprocessing unit 303, the caption data processing unit 304, and thesuperimposition processing unit 306.

The superimposition processing unit 306 performs superimpositionprocessing of the image data output from the image data processing unit302 and the caption data output from the caption data processing unit304, and outputs it to the display unit.

FIG. 16 shows a configuration example of hardware of a communicationapparatus applicable to the transmission apparatus 20 and the receptionapparatus 30.

A CPU (Central Processing Unit) 401 functions as a data processing unitthat performs various kinds of processing in accordance with a programstored in a ROM (Read Only Memory) 402 or a storage unit 408. Forexample, the CPU 401 performs the processing in accordance with thesequence described in the above-mentioned embodiment. A RAM (RandomAccess Memory) 403 stores the program executed by the CPU 401, data, andthe like. The CPU 401, the ROM 402, and the RAM 403 are connected toeach other by a bus 404.

The CPU 401 is connected to an input/output interface 405 via the bus404. To the input/output interface 405, an input unit 406 includingvarious switches, a keyboard, a mouse, a microphone, or the like, and anoutput unit 407 including a display, a speaker, or the like, areconnected. The CPU 401 performs various kinds of processing in responseto the command input from the input unit 406, and outputs the processingresult to the output unit 407, for example.

The storage unit 408 connected to the input/output interface 405includes, for example, a hard disk, and stores the program executed bythe CPU 401 and various kinds of data. A communication unit 409functions as a transmission/reception unit of data communication via anetwork such as the Internet and a local area network and as atransmission/reception unit of a broadcast wave, and communicates withan external apparatus.

A drive 410 connected to the input/output interface 405 drives aremovable medium 411 such as a magnetic disk, an optical disc, amagneto-optical disk, and a semiconductor memory including a memory cardto store or read data.

Note that although encoding or decoding of data can be performed as theprocessing performed by the CPU 401 serving as the data processing unit,it is possible to provide a codec serving as dedicated hardware forperforming encoding processing or decoding processing.

7. Summary of Configuration of Present Disclosure

In the above, the embodiment of the present disclosure has beendescribed with reference to a particular embodiment. However, it isobvious that various modifications or substitutions can be made to theembodiment of the present disclosure by a person skilled in the artwithout departing from the gist of the present disclosure. In otherwords, the present disclosure has been described hereinabove merely asan example, and hence should not be limitedly construed. The gist of thepresent disclosure should be determined with reference to the appendedclaims.

It should be noted that the present technology may take the followingconfigurations.

(1)

A reception apparatus, including:

a communication unit that receives TTML (Timed Text Markup Language)caption data; and

a data processing unit that performs caption display control, the TTMLcaption data being applied to the caption display control, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

(2)

The reception apparatus according to (1) above, in which

the data processing unit performs control of continuing memory storingand display processing of caption data currently displayed when a setvalue of the caption display end time specification attribute of theTTML caption data is the continuous display instruction value.

(3)

The reception apparatus according to (1) or (2) above, in which

the data processing unit performs control of continuing memory storingand display processing of caption data currently displayed when a setvalue of the caption display end time specification attribute of theTTML caption data is the continuous display instruction value even afterreproduction end timing of an image file, the image file being asynchronization processing file for the TTML caption data.

(4)

The reception apparatus according to any one of (1) to (3) above, inwhich

the data processing unit performs switching processing of TTML captiondata files to be processed while continuing memory storing and displayprocessing of caption data currently displayed when a set value of thecaption display end time specification attribute of the TTML captiondata is the continuous display instruction value.

(5)

The reception apparatus according to any one of (1) to (4) above, inwhich

the data processing unit

-   -   determines whether or not there is a text string stored in a        memory, display processing of the text string being continued,        when a set value of the caption display start time specification        attribute of the TTML caption data is the continuous display        instruction value,    -   determines, when there is the text string, whether or not a text        string for which the continuous display instruction value is set        is the same as the text string stored in the memory, display        processing of the text string being continued, and    -   performs, when they are the same, control of continuously        displaying a displayed text.

(6)

The reception apparatus according to any one of (1) to (5) above, inwhich

the data processing unit

-   -   determines whether or not there is a text string stored in a        memory, display processing of the text string being continued,        when a set value of the caption display start time specification        attribute of the TTML caption data is the continuous display        instruction value, and    -   performs, when there is not the text string, control of        immediately displaying a text string for which the continuous        display instruction value is set.

(7)

The reception apparatus according to any one of (1) to (6) above, inwhich

the caption display end time specification attribute of the TTML captiondata is one of an end attribute and a dur attribute.

(8)

The reception apparatus according to any one of (1) to (7) above, inwhich

the caption display start time specification attribute of the TTMLcaption data is a begin attribute.

(9)

A transmission apparatus, including:

a data processing unit that generates TTML (Timed Text Markup Language)caption data; and

a communication unit that transmits the TTML caption data, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

(10)

The transmission apparatus according to (9) above, in which

the data processing unit generates the TTML caption data having a setvalue of the caption display end time specification attribute, the setvalue being the continuous display instruction value, and

the TTML caption data is data that is capable of causing a captiondisplay processing performing apparatus to continue memory storing anddisplay processing of caption data currently displayed.

(11)

The transmission apparatus according to (9) or (10) above, in which

the data processing unit generates the TTML caption data having a setvalue of the caption display end time specification attribute, the setvalue being the continuous display instruction value, and

the TTML caption data is data that is capable of causing a captiondisplay processing performing apparatus to perform switching processingof TTML caption data files to be processed while continuing memorystoring and display processing of caption data currently displayed.

(12)

The transmission apparatus according to any one of (9) to (11) above, inwhich

the data processing unit generates the TTML caption data having a setvalue of the caption display start time specification attribute, the setvalue being the continuous display instruction value, and

the TTML caption data is data that is capable of causing a captiondisplay processing performing apparatus

-   -   to determine whether or not there is a text string stored in a        memory, display processing of the text string being continued,    -   to determine, when there is the text string, whether or not a        text string for which the continuous display instruction value        is set is the same as the text string stored in the memory,        display processing of the text string being continued, and    -   to perform, when they are the same, control of continuously        displaying a displayed text.

(13)

The transmission apparatus according to any one of (9) to (12) above, inwhich

the data processing unit generates the TTML caption data having a setvalue of the caption display start time specification attribute, the setvalue being the continuous display instruction value, and

the TTML caption data is data that is capable of causing a captiondisplay processing performing apparatus

-   -   to determine whether or not there is a text string stored in a        memory, display processing of the text string being continued,        when a set value of the caption display start time specification        attribute of the TTML caption data is the continuous display        instruction value, and    -   to perform, when there is not the text string, control of        immediately displaying a text string for which the continuous        display instruction value is set.

(14)

The transmission apparatus according to any one of (9) to (13) above, inwhich

the caption display end time specification attribute of the TTML captiondata is one of an end attribute and a dur attribute.

(15)

The transmission apparatus according to any one of (9) to (14) above, inwhich

the caption display start time specification attribute of the TTMLcaption data is a begin attribute.

(16)

A data processing method executed by a reception apparatus, including:

receiving, by a communication unit, TTML (Timed Text Markup Language)caption data;

performing, by a data processing unit, caption display control, the TTMLcaption data being applied to the caption display control, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

(17)

A data processing method executed by a transmission apparatus,including:

generating, by a data processing unit, TTML (Timed Text Markup Language)caption data; and

transmitting, by a communication unit, the TTML caption data, in which

the TTML caption data is data having a continuous display instructionvalue for performing continuous display processing of captions currentlydisplayed, as a value of at least one of a caption display end timespecification attribute and a caption display start time specificationattribute.

A series of processes described in the specification can be executedwith hardware, software, or a composite configuration of the hardwareand the software. In order that the processes can be executed with thesoftware, programs, which store a sequence of the processes and areinstalled in a memory in a computer incorporated in dedicated hardware,are executed. Alternatively, the programs to be executed may beinstalled in a general purpose computer capable of executing variousprocesses. For example, the programs may be recorded in advance in arecording medium, or may be installed from the recording medium to thecomputer. Alternatively, the programs may be received via networks suchas a LAN (Local Area Network) or the Internet, and then installed torecording medium such as a built-in hard disk.

Note that, the various processes described in the specification need notnecessarily be executed in time series according to the description, andmay be executed in parallel or individually in accordance withprocessing capabilities of apparatus that execute the processes or asappropriate. Further, the “system” in this specification refers to alogical collective configuration of a plurality of apparatus, and thoseapparatus having respective configurations are not necessarily providedin the same casing.

As described above, according to the configuration of an embodiment ofthe present disclosure, it is possible to achieve an apparatus and amethod that are capable of performing reliable display control with nocaption display interruption of the TTML caption data.

Specifically, when a set value of the caption display end timespecification attribute of the TTML caption data is the continuousdisplay instruction value, memory storing and display processing ofcaption data currently displayed are continued even after reproductionend timing of an image file, and switching processing of TTML captiondata files is performed. In the case where a set value of the captiondisplay start time specification attribute of the TTML caption data isthe continuous display instruction value, whether or not there is a textstring of which display processing is continued is determined, and whenthere is the text string, the displayed text is continuously displayed.

According to this configuration, it is possible to achieve displaycontrol with no caption display interruption of the TTML caption data.

DESCRIPTION OF REFERENCE NUMERALS

10 communication system, 20 transmission apparatus, 21 broadcastingstation, 22 data delivery server, 30 reception apparatus, 31 TV, 32 PC,33 portable terminal, 50 image data, 60 audio data, 70 caption data, 80signaling data, 101 to 103 image/audio data file, 111 to 113 captiondata file (TTML), 121 to 123 caption data file (TTML), 141 dataprocessing unit, 142 communication unit, 143 storage unit, 151 dataprocessing unit, 152 communication unit, 153 storage unit, 154 inputunit, 155 output unit, 201 image data processing unit, 202 audio dataprocessing unit, 203 caption data processing unit, 204 control signalgeneration unit, 205 packet generation unit, 301 packet demultiplexingunit, 302 image data processing unit, 303 audio data processing unit,304 caption data processing unit, 305 control unit, 306 superimpositionprocessing unit, 401 CPU, 402 ROM, 403 RAM, 404 bus, 405 input/outputinterface, 406 input unit, 407 output unit, 408 storage unit, 409communication unit, 410 drive, 411 removable medium

The invention claimed is:
 1. A reception apparatus, comprising:processing circuitry configured to: receive plural Timed Text MarkupLanguage (TTML) caption data files, each TTML caption data file of theplural TTML caption data files includes a value of at least one of acaption display end time specification attribute, a duration timespecification attribute, and a caption display start time specificationattribute; perform a caption display control with a first TTML captiondata file of the plural TTML caption data files; determine whether thecaption display end time specification attribute or the duration timespecification attribute of the first TTML caption data file includes atext string that indicates a continuous display instruction valueinstead of a time value; in response to a determination that the captiondisplay end time specification attribute or the duration timespecification attribute includes the continuous display instructionvalue, the processing circuitry is configured to: perform continuousdisplay processing of first caption data associated with the continuousdisplay instruction value until a next TTML caption data file isprocessed.
 2. The reception apparatus according to claim 1, wherein theprocessing circuitry continues to perform the continuous displayprocessing of the first caption data, after reproduction end timing ofan image file, the image file being a synchronization processing filefor the TTML caption data file.
 3. The reception apparatus according toclaim 1, wherein the processing circuitry determines whether thecontinuous display processing of the first caption data is performed,when a set value of the caption display start time specificationattribute of a second TTML caption data file of the plural TTML captiondata files is a continuous display instruction value for performingcontinuous display processing of second caption data captions,determines, when the continuous display processing of the first captiondata is performed, whether or not the first caption data is the same asthe second caption data, and performs, when the first caption data andthe second caption data are the same, control of continuously displayingthe first caption data.
 4. The reception apparatus according to claim 1,wherein the processing circuitry determines whether the continuousdisplay processing of the first caption data is performed, when a setvalue of the caption display start time specification attribute of asecond TTML caption data file of the plural TTML caption data files is acontinuous display instruction value for performing continuous displayprocessing of second caption data, and performs, when the continuousdisplay processing of the first caption data is not performed, controlof immediately displaying the second caption data.
 5. The receptionapparatus according to claim 1, wherein the caption display end timespecification attribute of the TTML caption data files is one of an endattribute and a duration attribute.
 6. The reception apparatus accordingto claim 1, wherein the caption display start time specificationattribute of the TTML caption data files is a begin attribute.
 7. Atransmission apparatus, comprising: processing circuitry configured to:generate plural Timed Text Markup Language (TTML) caption data files,each TTML caption data file of the plural TTML caption data filesincludes a value of at least one of a caption display end timespecification attribute, a duration time specification attribute, and acaption display start time specification attribute; and transmit theplural TTML caption data files, wherein a reception apparatus receivingthe transmitted TTML caption data files is configured to in response toa determination that the caption display end time specificationattribute or the duration time specification attribute of a first TTMLcaption data file of the transmitted TTML caption data files includes atext string that indicates a continuous display instruction valueinstead of a time value, control continuous display processing of firstcaption data associated with the continuous display instruction valueuntil a next TTML caption data file is processed.
 8. The transmissionapparatus according to claim 7, wherein the processing circuitrygenerates the first TTML caption data file having the caption displayend time specification attribute being the continuous displayinstruction value.
 9. The transmission apparatus according to claim 7,wherein the processing circuitry generates a second TTML caption datafile of the plural TTML caption data files having the caption displaystart time specification attribute being the continuous displayinstruction value for performing continuous display processing of secondcaption data, and the second TTML caption data file causes the receptionapparatus to determine whether the continuous display processing of thefirst caption data is performed, determine, when the continuous displayprocessing of the first caption data is performed, whether or not thefirst caption data is the same as the second caption data, and perform,when the first caption data and the second caption data are the same,control of continuously displaying the first caption data.
 10. Thetransmission apparatus according to claim 7, wherein the processingcircuitry generates a second TTML caption data file of the plural TTMLcaption data files having the caption display start time specificationattribute being the continuous display instruction value for performingcontinuous display processing of second caption data, and the secondTTML caption data file causes the reception apparatus to determinewhether the continuous display processing of the first caption data isperformed, and perform, when the continuous display processing of thefirst caption data is not performed, control of immediately displayingthe second caption data.
 11. The transmission apparatus according toclaim 7, wherein the caption display end time specification attribute ofthe TTML caption data files is one of an end attribute and a durationattribute.
 12. The transmission apparatus according to claim 7, whereinthe caption display start time specification attribute of the TTMLcaption data files is a begin attribute.
 13. A data processing methodexecuted by a reception apparatus, comprising: receiving, by processingcircuitry of the reception apparatus, plural Timed Text Markup Language(TTML) caption data files, each TTML caption data file of the pluralTTML caption data files includes a value of at least one of a captiondisplay end time specification attribute, a duration time specificationattribute, and a caption display start time specification attribute;performing, by the processing circuitry, a caption display control witha first TTML caption data file of the plural TTML caption data files;determining whether the caption display end time specification attributeor the duration time specification attribute of the first TTML captiondata file includes a text string that indicates a continuous displayinstruction value instead of a time value; in response to adetermination that the caption display end time specification attributeor the duration time specification attribute includes the continuousdisplay instruction value, performing continuous display processing offirst caption data associated with the continuous display instructionvalue until a next TTML caption data file is processed.
 14. A dataprocessing method executed by a transmission apparatus, comprising:generating, by processing circuitry of the transmission apparatus,plural Timed Text Markup Language (TTML) caption data files, each TTMLcaption data file of the plural TTML caption data files includes a valueof at least one of a caption display end time specification attribute, aduration time specification attribute, and a caption display start timespecification attribute; and transmitting, by the processing circuitry,the plural TTML caption data files, wherein a reception apparatusreceiving the transmitted TTML caption data files is configured to inresponse to a determination that the caption display end timespecification attribute or the duration time specification attribute ofa first TTML caption data file of the transmitted TTML caption datafiles includes a text string that indicates a continuous displayinstruction value instead of a time value, control continuous displayprocessing of first caption data associated with the continuous displayinstruction value until a next TTML caption data file is processed.